Empirical and theoretical support for lenient learning
نویسندگان
چکیده
Recently, an evolutionary model of Lenient Q-learning (LQ) has been proposed, providing theoretical guarantees of convergence to the global optimum in cooperative multi-agent learning. However, experiments reveal discrepancies between the predicted dynamics of the evolutionary model and the actual learning behavior of the Lenient Q-learning algorithm, which undermines its theoretical foundation. Moreover it turns out that the predicted behavior of the model is more desirable than the observed behavior of the algorithm. We propose the variant Lenient Frequency Adjusted Qlearning (LFAQ) which inherits the theoretical guarantees and resolves this issue. The advantages of LFAQ are demonstrated by comparing the evolutionary dynamics of lenient vs non-lenient Frequency Adjusted Q-learning. In addition, we analyze the behavior, convergence properties and performance of these two learning algorithms empirically. The algorithms are evaluated in the Battle of the Sexes (BoS) and the Stag Hunt (SH), while compensating for intrinsic learning speed differences. Significant deviations arise from the introduction of leniency, leading to profound performance gains in coordination games against both lenient and non-lenient learners.
منابع مشابه
Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior
Most of the validation studies conducted across varying test application contexts are usually framed within the traditional conceptualization of validity and therefore lack a comprehensive framework to focus on test score interpretations and test score use. This study aimed at developing and validating a collocational behavior test (CBT), drawing on Kane's argument-based approach to validity. F...
متن کاملCloud Computing; A New Approach to Learning and Learning
Introduction: The cloud computing and services, as a technological solution for developing educational services, can accelerate the provision and expansion of these highly useful services. This study intended to provide an overall picture of practical areas of learning services based on cloud computing teaching and learning equipment. Methods: This was a theoretical hybrid research study in whi...
متن کاملThe Effect of Four Different Types of Involvement Indices on Vocabulary Learning and Retention of EFL Learners
The purpose of the present study was to provide empirical support for the construct of the involvement load hypothesis (ILH ) in an EFL context. To fulfill the purpose of the study, 4 intact groups consisting of 126 intermediate-level students participated in this experiment. In order to ensure that the participants were at the same level of English language proficiency, the Nelson test was adm...
متن کاملMidlife crisis: a debate.
Without doubt, the midlife crisis is the most popular concept describing middle adulthood. Facing the limitation of the time until death, men in particular are believed to pause from actively pursuing their goals and review their achievements, take stock of what they have and have not yet accomplished, at times taking drastic measures to fulfill their dreams. This paper critically discusses the...
متن کاملReinforcement Learning in Multi-agent Games
This article investigates the performance of independent reinforcement learners in multiagent games. Convergence to Nash equilibria and parameter settings for desired learning behavior are discussed for Q-learning, Frequency Maximum Q value (FMQ) learning and lenient Q-learning. FMQ and lenient Q-learning are shown to outperform regular Q-learning significantly in the context of coordination ga...
متن کامل